CDS
Accession Number | TCMCG029C31244 |
gbkey | CDS |
Protein Id | XP_023734315.1 |
Location | join(257352..258177,258274..258370,258492..258594,258681..258734,258837..258911,258988..259056,259163..259246,259342..259536,259639..260162,260249..260378,260469..260560,260658..261039,261119..261220,261327..261489,261583..261875,261957..262094,262242..262321,262406..262505,262632..262761,262844..263028) |
Gene | LOC111882195 |
GeneID | 111882195 |
Organism | Lactuca sativa |
Protein
Length | 1273aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA432228 |
db_source | XM_023878547.1 |
Definition | DNA mismatch repair protein MSH6 isoform X4 [Lactuca sativa] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGGCATTTCGTCGCCCAACCAACGGCCGATCGCCGCTCGTTAATCCACAACGTCAAATCACCTCTTTCTTCTCCAAATCACCATCATCAACTTCCTCCCTTTCTCCTTCCCAGTCACCATCCCCTCTTCTTTCTAACTCTAATTCGAACTCGAACTCTAAACTTAAACCTAAAACTAAACCTAGCCCTACAACCCCATCCCCTTTGCAAACCACAAGCAGTAAGAAACGGGCCTTGGTTATTGGTCAATCTTCTTCTACTCCCGCTTCCGATGCCCAAAACCCTAGATATGGCGACGAAGTAGTTAATCGAAGGATTAAGGTTTACTGGCCACTGGACAAGGCTTGGTATGAAGGCTGTGTCAAAGCTTTTGATAAGAGTTCGGGTAAGCATTTGGTTCAGTATGATGATGGTGAAGAGGAGCATTTGGATTTATCTAAAGAGAAGATTGAGTTGTTAAAGGAGCAGGCTAAAAGGTTCCGTAGGTTGAGGAAATTTTCCATTGAAGACGAAGATGATGATGAGGCTGGAGGCGGTGCCAAGGGGAACGTGGATAAGAATGTAGAAAGTGGAGGAGATGATTCTGATGATGAGGATTGGGGAATGCATGTCGAAAAGGAAGCTATTGACGATGAAATGGAGGATTTAGAGTTGGTAGACGAGAACGAGGAAGAGGAGGAGGAAGTGGAAGAAACGAAGGCAATAAAGCCGGACTCCAAAAAGCGGAAGGTTTTTGGAATGAAATCAGCCTCTCTTAAGAAAATCAAGAACGAGGCTCCGTTGGATTTAAGCCCGTGCAATTTGGAGCATAAAACCAATAATAACAGTGCGAAGGCATCTGCTTTTGTTGACAATGATCTTGTTGGCGATAAAGCTGAAAGATTTACCACAAGAGAAGAAGAGAAGTTCAAATTTCTTGGGAAAGCCCGGAAGGATGCCAAGAAGAGATCCCCTGATGATGAAAATTATGATTCAAGAACTCTATACTTGCCTCCTGACTTTTTGGAGAGTTTATCAGGTGGCCAGAGACAATGGTGGGAGTTCAAGTCACAACATATGGATAAGGTTCTATTTTTTAAGATGGGAAAGTTTTACGAGCTCTTTGAAATGGATGCACATGTTGGAGCAAAAGAACTTGATTTGCAATACATGAAGGGAGACCAGCCTCATTGTGGATTCCCAGAAAAGAACTTTGCACTAAATGTAGAGAAGTTGGCTCGCAAGGGTTACCGAGTTCTGGTTATTGAGCAGACAGAGACACCTGATCAGCTTGAGAGGCGTCGCAAAGAGCAGGGTACTAGAGATAAGGTTGTGAAAAGGGAAATATGTGGAGTAGTCACTAAAGGAACATTGGTTGATGGAGAAATGGTGGCAGCAAATCCGGATGCTTCTTATCTGTTTGCAGTTTCTGAATGCTATGAAGCATCAGGAAACCAACGTGATGATAGAATCTATGGTGTTTGTGTGGTTGATGTTGCTACAAGCAAGATCATGATAGGACAGTTTGGAGATGACACAGAGTGCAGTGTGCTGAGCTGCTTATTGTCCCAACTAAGACCAGTGGAAATCATAAAACCTGTCAAATCACTCAGCCCTGAAACTGAAAGAGTACTACTAAGACAAACAAGAAGCCCTGTCATAAATGAGTTGATACCACTTGAAGAGTTTTGGGATGCAGAGAAAACTATGTGTGAAGTGAAGGAGATTTATAAGCGCATCAGTAATCAATCATGTTTGAATGAATCCATGTCATGCTCATCTGATACTAAGGATTGCCTACCAGAAGTACTCTCTGATCTAATGAATACTGGAAATGTTGGTAGTTATGCTCTGTCAGCCCTTGGGGGCACTCTGTTCTACCTGAGAAAAGCCTTCCTGGACGAGTCATTGCTTCGATTTGCAAAGTTTGAGCGACTTCCTTGTTCTGGATTCAATGATTCCACCATAAAACCATACATGGTTCTTGATGCAGCTGCTCTAGAGAATCTTGAAGTTTTTGAGAACAGTGTAAATGGAGACTCTAAGGGGACATTGTATGAGCAACTAAATCGTTGTGTGACAGCATTTGGGAAGAGGTTGCTTAAAACATGGCTCTCTAGACCTTTATATCACATAGACTCAATCAGGGAACGCCAGAATGCTGTAGCTGGTGTAAAGGGAGTTAGCCTGCCTTATGCTCTTGAATTTCGTAAAGAGCTGTCCAAGCTTCCAGACATGGAGCGGTTGTTGGCACGCATCTTTTCTTGCAGTGAAGCTAATGGTAGGAATTCGAGTAAAGTGGTTCTGTATGAGGATGCATCAAAGAAACAACTTGAACATTTCATAATGGTTCTCAGTGGGTGTGAAGTAATTATAAATGCATGCTCCTCGCTAGGTGTCATTCTGGAAAACACTGATTCTAGGCTGCTGCATCACCTGTTAGCACCTGGTAAAGGTCTTCCGGATGTTGATGGTGTTCTTAGGCATTTCAAGGATGCTTTTGATTGGATGGAGGCAAAAAGTTCAGGGCGTATAATACCTCGTGATGGGGTTGATAAAGAATACGATACTGCTTGCGGAATGGTTACAGATATTGAGTTTAGTCTGAGAAAGCATTTGAAGGAACAGAGAAAACTTCTTGGAGATTCATCGATCAATTATGTTACTGTTGGAAAGGATACATATCTTCTTGAAGTAGCTGAAAGTTTGTCTGGTAGCGTTCCTTGTGAGTATGAGCGTCGATCATCTAAGAAGGGTTTTGTCCGATACTGGACTCCTGAAATTAGGAATTTGATGAGGGAGCTGTCAGAAGCTGAATCCGAGAAAGAGTCCAAGTTGAAAAGCATCATGCAGAGGTTGATTGGGCGGTTTTGTGAGCATCATGTGAGCTGGAGACAGTTGGTTTCTACAGCTGCAGAACTTGATGTCCTGATCAGCATAGCAATTGCGAGTGACATGTATGAAGGACCAACATGTCGTCCGCTTATAGTGGATTTGGATGGGGATGAAGCACCAGTTGTCGATGCTAAAAGTCTAGGGCATCCTGTGCTTGGAAATGATACTCTAGGGGATGGCAGTGGCAACTTTGTCCCAAATGATGTTTGTATTGGTGGGGCGGATCATGCCAGATTTATCGTGCTTACTGGTCCTAACATGGGTGGAAAGTCTACTCTTCTGCGTCAAGTTTGTTTAGCACTGATTTTGGCACAGGTGGGGGCAGATGTGCCTGCAGAAAGCTTTAAGATGTCTCCGGTTGATCGCATCTTTGTGAGGATGGGTGCAAAAGACCATATTATGGCAGGCCACAGTACATTTCTAACCGAGCTGCTGGAAACCGCATCCATGCTGTCATCAGCAACACGGAGTTCGGTTGTGGCATTAGATGAACTTGGACGAGGAACAGCAACATCAGATGGACAAGCCATAGCTGCATCGGTTCTTGAACACCTTGTGAACAAGGTCCAGTGTCGGGGTCTGTTTTCTACTCACTATCATCACTTAGCTTTGGAGTATCAGCAGATTGACAAGGTTTCCCTATGTCACATGGCATGCCAAGTTGGAGATGGAGATGGAGGTGTAGAGGAGGTAACATTTCTCTACAAATTGACACTTGGTGCATGCCCCAAAAGCTATGGTGTCAACGTTGCACGCCTAGCAGGACTTCCTGATGCCGTGCTTAAAAAGGCTGCAATTAAGTCCCAAGAGTTTGAGACAATGTATGGTAAAAGGACAAGGACAAACCAAAACCAGATAGCGCTCATGTTGCAGAGCTTAAACAATTGTCATGGGAATGGGATTCTTGATTTACAGAACAGGGCAAAGATATTTTTGGAGCACAAGTGA |
Protein: MAFRRPTNGRSPLVNPQRQITSFFSKSPSSTSSLSPSQSPSPLLSNSNSNSNSKLKPKTKPSPTTPSPLQTTSSKKRALVIGQSSSTPASDAQNPRYGDEVVNRRIKVYWPLDKAWYEGCVKAFDKSSGKHLVQYDDGEEEHLDLSKEKIELLKEQAKRFRRLRKFSIEDEDDDEAGGGAKGNVDKNVESGGDDSDDEDWGMHVEKEAIDDEMEDLELVDENEEEEEEVEETKAIKPDSKKRKVFGMKSASLKKIKNEAPLDLSPCNLEHKTNNNSAKASAFVDNDLVGDKAERFTTREEEKFKFLGKARKDAKKRSPDDENYDSRTLYLPPDFLESLSGGQRQWWEFKSQHMDKVLFFKMGKFYELFEMDAHVGAKELDLQYMKGDQPHCGFPEKNFALNVEKLARKGYRVLVIEQTETPDQLERRRKEQGTRDKVVKREICGVVTKGTLVDGEMVAANPDASYLFAVSECYEASGNQRDDRIYGVCVVDVATSKIMIGQFGDDTECSVLSCLLSQLRPVEIIKPVKSLSPETERVLLRQTRSPVINELIPLEEFWDAEKTMCEVKEIYKRISNQSCLNESMSCSSDTKDCLPEVLSDLMNTGNVGSYALSALGGTLFYLRKAFLDESLLRFAKFERLPCSGFNDSTIKPYMVLDAAALENLEVFENSVNGDSKGTLYEQLNRCVTAFGKRLLKTWLSRPLYHIDSIRERQNAVAGVKGVSLPYALEFRKELSKLPDMERLLARIFSCSEANGRNSSKVVLYEDASKKQLEHFIMVLSGCEVIINACSSLGVILENTDSRLLHHLLAPGKGLPDVDGVLRHFKDAFDWMEAKSSGRIIPRDGVDKEYDTACGMVTDIEFSLRKHLKEQRKLLGDSSINYVTVGKDTYLLEVAESLSGSVPCEYERRSSKKGFVRYWTPEIRNLMRELSEAESEKESKLKSIMQRLIGRFCEHHVSWRQLVSTAAELDVLISIAIASDMYEGPTCRPLIVDLDGDEAPVVDAKSLGHPVLGNDTLGDGSGNFVPNDVCIGGADHARFIVLTGPNMGGKSTLLRQVCLALILAQVGADVPAESFKMSPVDRIFVRMGAKDHIMAGHSTFLTELLETASMLSSATRSSVVALDELGRGTATSDGQAIAASVLEHLVNKVQCRGLFSTHYHHLALEYQQIDKVSLCHMACQVGDGDGGVEEVTFLYKLTLGACPKSYGVNVARLAGLPDAVLKKAAIKSQEFETMYGKRTRTNQNQIALMLQSLNNCHGNGILDLQNRAKIFLEHK |